QUANTITATIVE CONCENTRATION INEQUALITIES ON SAMPLE 
PATH SPACE FOR MEAN FIELD INTERACTION 



FRANgOIS BOLLEY 

Abstract. We consider a system of particles experiencing diffusion and mean field inter- 
action, and study its behaviour when the number of particles goes to infinity. We derive 
non-asymptotic large deviation bounds measuring the concentration of the empirical mea- 
sure of the paths of the particles around its limit. The method is based on a coupling 
argument, strong integrability estimates on the paths in Holder norm, and some general 
concentration result for the empirical measure of identically distributed independent paths. 
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Introduction 

This paper is devoted to the study of the behaviour of some large stochastic particle 
system. In the models to be considered, the evolution of each particle is governed by a 
random diffusive term, an exterior force field and a mean field interaction with the other 
particles. For such models the limit behaviour has been clearly identified and studied in 
terms of law of large numbers, central limit theorem and large deviations. Here we shall 
give new quantitative estimates on the convergence in the setting of large deviations. 

This follows some works addressing this issue at the level of observables or at the level of 
the whole system at a given time, that we now summarize. For that purpose let (X^*)i<j<jv 
be the position at time t of the N particles in the phase space Mf^ and let fit be some 
probability measure describing the limit behaviour of the system. At the level of Lipschitz 
observables, F. Malrieu adapted ideas of concentration of measure to obtain bounds like 
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<2e-^^'\ N>1, (1) 

where C and A are some constants independent of e and A^, and [-ji is the Lipschitz 
seminorm defined by 

\<f{x) -(f{y)\ 
Ml := sup r^r~\ — 

In other words, letting stand for the Dirac mass at a point x G Mf^, the empirical measure 

1 ^ 

i=l 

of the system, which generates the observables at time t, satisfies the deviation inequality 

<2e-^^'\ N>1. 

Now one can measure how this empirical measure /tf is close to its limit fit in a stronger 
sense, namely, at the very level of the measures. For this, adapting Sanov's large deviation 
argument, the authors in ^ got quantitative and non- asymptotic bounds on the deviation 
of /if' around fit for some distance which induces a topology stronger that the narrow 
topology. By comparison with (pj, these bounds can be written as 

< C{e) e-^^^' , N>1. (2) 

In this work we want to go one step further by considering the trajectories of the 
particles. A natural object to consider is the empirical measure of the trajectories (Xj)o<t<T 
on some given time interval [0,T], which is defined as 

A[0,T] ■= j;^^^{Xl)o<t<T 
i=l 

where S{xi)a<t<T Dirac mass on the path (X^*)o<t<r- This is a random probability 

measure, no longer on the phase space R'^, but now on the path space, which in our model 
is the space of M'^- valued continuous functions on [0,T]. 

Its limit behaviour is given as follows: the limit fit of the empirical measure fi'^ at time 
t can be seen as the law of the solution at time t to a stochastic differential equation; then 
the law of the whole process on [0, T] so defined will be the limit of p'^rpy We shall give a 
precise meaning to this convergence, and some estimates which are the analogue of Q in 
the path space; in particular we shall see that they imply © by projection at time t. 

In the next section we state our main results and give an insight of the proofs, which 
will be given in more detail in the following sections. 
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1. Statement of the results 

1.1. Some notation and definitions. One of the key points in this work is to measure 
the discrepancy between probability measures: this will be done by means of Wasserstein 
distances, which have revealed convenient in this type of issues and are defined as follows. 
Let {X,d) be a separable and complete metric space, and p be a real number > 1; the 
Wasserstein distance of order p between two Borel probability measures fi and on X is 

Wp{iJ,iy) ■=M (^JJ d{x,yYd7r{x,y)^ 

where vr runs over the set of all joint measures on X x X with marginals /i and u. 
Wp induces a metric on the set of Borel probability measures on X with finite moment 



d{xo^ xY dfj,{x) for some (and thus any) xq in X; convergence in this metric is equi- 

X 

valent to narrow convergence (against bounded continuous functions) plus some tightness 
condition on the moments (see for instance ^2]J for further details on these distances). 

At some point the space (X, d) will be M*^ equipped with the Euclidean distance | ■ | , and 
in this context Wp will be denoted Wp^r- But (X, rf) will mainly be the space C([0, T], M"^), 
also denoted C if no confusion is possible, of M'^- valued continuous functions on [0,T], 
equipped with the uniform norm 

||/1U:= sup \f{t)\; 

0<t<T 

for this space Wp will be denoted PVp,[o,T]- The Wasserstein distances considered in these 
two situations are linked in the following way: if, for < t < T, vrt is the projection from 
C into M*^ defined by 7it{f) = fit), then for any Borel probabihty measures fi and z/ on C, 
and any p > 1, the relation 

Wp^r{Mf^,7lt^l^) < W^p,[0,T](/i, l^), 0<t<T (3) 

holds, where vrtjj/i is the image measure of /i by nt. 

The distance between two probability measures fi and u on X can also be expressed in 
terms of the relative entropy of u with respect to fi (for instance), defined by 

H{iy\fi) = [ ——\n——dn 



J X dfi dfi 

if u is absolutely continuous with respect to fi, and H{v\fi) = +oo otherwise. 

Both notions are linked by the family of transportation or Talagrand inequalities: given 
p >1 and A > 0, we say that a probability measure /x on X satisfies the inequality ^^(A) if 



Wp{fi,u)<^'^H{u\fi) 

holds true for any measure and /x satisfies Tp if it satisfies ^^(A) for some A > 0. By 
Jensen's inequality, the weakest of all is Ti, which is also the only one for which a simple 
characterization is known: a measure /i satisfies Ti(A) for some A > if and only if it admits 
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a square-exponential moment, in the sense that there exist a > and Xq in X such that 

/ e-'" M^) be flmte. Numancal .elation^ between .uch a and A a.e given n> 01111. 

Jx 

1.2. A general concentration inequality for empirical measures. The proof of our 
main theorem on the particle system is based on some general concentration result for the 
empirical measure of C-valued independent and identically distributed random variables. 
We state this result separately. 

For this purpose, given some Borel probability measure n on C and N independent 
random variables (X*)i<j<jv with law fi, we let 

1 ^ 

i=l 

denote their empirical measure. 

Given some real number a G (0, 1], we let C := C"([0, T], R'^) be the space of functions 
in C := C([0, T], W^) which moreover are Holder of order a, equipped with the Holder norm 



:=sup(||/||oo,[/W 
where 

0<t,s<T \t — si" 

is a Borel set of the space C equipped with the topology induced by the uniform norm, 
and for Borel measures on C, concentrated on C", we have in the above notation: 

Theorem 1. Let p G [1, 2] and let ^ he a Borel probability measure on C satisfying a Tp(A) 
inequality for some A > 0, and such that J e'*^'^"" djj,{x) be finite for some a > and 
a G (0, 1]. Then, for any a' < a and X' < X, there exists some constant Nq such that 

P[iy,,[o,T](/.,/i^)>e] <e-^^^^^^ (4) 
for any e > and N > NqE'"^ exp {Nqe^^^""'), where 

o ^\ 1 1 < p < 2 

\ (1 + v/a7^)-2 p = 2. 

Here the constant depends on only through A, a, a and j e""^"" dfi{x). 
Let us make a few remarks on this result. 

First of all, for another formulation of the obtained bound in the case when p = 1, we 
recall Kantorovich- Rubinstein dual expression of the Wi distance on a general space (X, d): 

Wi{n,iy)= sup < (fd^— (pduy (5) 
Mi<i ^ Jx ' 
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where 



sup 



■ Then a result by S. Bobkov and F. Gotze |3] ensures that 



d{x,y) 

a Ti(A) inequality for fi is equivalent to the concentration inequality 



sup P 

Mi<i 



1 ^ r 

A 1 X 



< e 



> 1. 



By comparison, the bound given by Theorem ^ implies 
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/I ^ ■ f 
Mi<i^^7I^ Jc 



> e 
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A' < A , large enough 



but a modification of the proof would also lead to 

N 

>e 



P 



Mi<i^^ ^ Jc 



< C{e) e 



A'<A, A^>1, 



(6) 



for some computable large constant C{e). In other words we control a much stronger 
quantity, up to some loss on the constant in the right-hand side, or some condition on the 
size of the sample. 

This result seems reasonable in view of Sanov's theorem (stated in JU] for instance). By 
applying this theorem to A := {i/; Wp^[o^T]{^, fJ') > £}, for some given e > 0, one can hope 
for an upper bound like 

P [W^p,[o,T](/x, A^) > < exp ( - iVinf {H{u\fi); v G A}) 

for large A^. With this bound in hand, since 

inf{i7(z/|/i);z/GA}>^£2 

as [I satisfies Tp(A), one indeed obtains an upper bound like Q, but only in an asymptotic 
way, whereas Theorem ^ moreover gives an estimate on a sufficient size of the sample 
for the deviation bound to hold. Sanov's theorem does not actually give such an upper 
bound here; indeed, on an unbounded space such as C, the closure A oi A (for the narrow 
topology) contains /i itself: in particular \vS.{H{y\[i)\ z/ G A} = and Sanov's theorem 
only gives the trivial upper bound P \A\ < exp ( — N mf{H{u\fi)] v G A}) = 1. 

To get a more relevant upper bound we impose some extra integrability assumption that 
may at first sight seem strong and odd. The reason is that, proceeding as in [B], we first 
reduce the issue to a compact set of C that has almost full /x measure: a large ball of will 
do by Ascoli's theorem and the integrability assumption on /i. Then on this compact set 
one can get precise upper bounds by using some techniques based on a covering argument 
and developed in [T3] (see also [TUl Exercises 4.4.5 and 6.2.19] and [IH])- And actually 
the assumption is satisfied by the Wiener measure on C (recall that the Brownian motion 
paths are almost surely Holder of order a for any a < 1/2) and by extension by the law of 
the process to be considered. 
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This integrability assumption again implies the existence of a square-exponential moment 
for fi on C (for the uniform norm). Since this is equivalent to some Ti inequality for /i, the 
Tp(A) assumption is redondant when p = 1 if one does not care of the involved constants. 

Finally this result can be seen as an extension of the following similar concentration 
result given in [HI Theorem 1.1] in the case of measures on M*^: if m satisfies Tp(A), then 

F[Wp^r{m,m^)> 6]<e~'^''^^'\ 6>0, N > Nom8ix{e~^^'+^\l). (7) 

Let indeed m be such a measure on W^. Then the law of a constant process on [0, T] 
initially distributed according to m satisfies the assumptions of Theorem Q (one can take 
any a < A/2), and the bound ((Tj) follows by (jS)) with the constant jp obtained in jH]. Note 
however that the required size of the sample is here much larger for small e. 

Theorem 2] will be proved in Section |21 

1.3. Interacting particle systems. We now turn to the study of a system of stochastic 
interacting particles which positions XI in the phase space M'^ (1 < i < A^) evolve according 
to the system of coupled stochastic differential equations 

1 ^ 

dXl = V2 dBl - VV{Xl)dt - ^ XI ^^(^i - )^^' l<i<N. (8) 

Here the i?"s are N standard independent Brownian motions on R"^, V and W are exterior 
and interaction potentials. 

1 ^ 

The state of the system at some given time t is given by the observables — (p{Xl), 

i=l 

and thus can be described by the random probability measure 

1 ^ 

i=l 

on the phase space W^, called the empirical measure of the system. Under some regularity 
and growth assumptions on the potentials V and W , and if the particles are initially dis- 
tributed in a chaotic way, for instance as independent and identically distributed variables, 
then fif converges to a solution at time t to the partial differential equation 

^ = A/ii + V • {^itV{V + W * fit)) (9) 

as the number of particles goes to infinity. This nonlinear diffusive equation, in which 
A, V- and V respectively denote the Laplace, divergence and gradient operators in M'^, is a 
McKean-Vlasov equation and has been used in |3] in the modelling of one-dimensional gra- 
nular media. The convergence of fif is strongly linked with the phenomenon of propagation 
of chaos for the interacting particles, and both issues have been studied by H. Tanaka 1^] , 
A.-S. Sznitman |1JJ, S. Meleard jTTj or S. Benachour, B. Roynette, D. Talay and P. Vallois 
m 12] for instance. Then quantitative estimates on this convergence have been obtained by 
F. Malrieu JHl at the level of observables, and later at the very level of the law in 
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In this work we go one step further and study the hmit behaviour of the empirical measure 

1 ^ 

j=l 

of the trajectories X* = {Xl)Q<t<T of the particles on some time interval [0,T]. 

For that purpose, let Y = (l^)o<t<r be a solution to the stochastic differential equation 

dYt = V2dBt- \/V{Yt) dt - WW * vt{Yt) dt (10) 

starting at Yq distributed according to the initial condition /xo in Q, where ut is the law 
of Yf at time t. Then, by Ito's formula, also is a solution to equation Q with initial 
datum hq, and a uniqueness result ensures that actually = fif In other words the limit 
behaviour /i^ of /if is the time-marginal of the law /X[o,r] of the process Y, in the sense that 
it is the image measure of /i[o,T] by the canonical projection ttj defined on the path space C 
by TTt{f) = f(t). Since /if is the time-marginal of the considered empirical measure fi^j.-^, 
it is natural to hope that fi^rp^ converge (in some sense) to yU[o,T]- This convergence has 
indeed been proved in the first works mentionned above, and here we want to extend to 
this new setting the techniques developed in [JB^ and more particularly in fo*^. 

We shall assume that the potentials V and W are twice different iable on M'^, with 
bounded hessian matrices in the sense that there exist real numbers /3, 7 and 7' such that 



pl< DV(x) <P'I, -fl< D^W{x) < 7 /, X G R'^. (11) 
In other words the force fields VV^ and VW are assumed to be Lipschitz on the whole M."^. 

Under these assumptions, global existence and uniqueness, pathwise and in law, of the 
solutions to (jHI) and (fTIH) are proven in ^\ for instance for square integrable initial data; 
moreover the paths are continuous (in time). We shall also assume that the potential W, 
which gives rise to an interaction term, is symmetric in the sense that W{—z) = W{z) for 
all z e M'^. Then we shall prove 

Theorem 2. Let fiQ be a probability measure on M.'^, admitting a finite square- exponential 
moment in the sense that there exists Oq > such that / e'*°''^'^ (i/io(x) be finite. Let 

{^o)^<i<N be N independent random variables with common law fiQ. Given T > 0, let 
(X*)j be the solution of ^ on [0,T] with initial value {Xq)^, where V and W are assumed 
to satisfy (fTTj); let also /ij^j^j be the empirical measure associated with the N paths X\ Let 
finally /i[o,T] be the law of the process solution of (|l(jp for some initial value distributed 
according to fiQ. 

Then, for any a G (0, 1/2), there exist some positive constants K and Nq such that 

P [W^i,[o.T](/i[o,T],/iS,T])>^] <e-^^^' 
for all£>0 and N > Nq£-^ exp (Xoe"^/"). 
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Here the constants K and A^o depend on T, V, W, a and / e"°'^''^ dfio{x). 
By Kantorovich-Rubinstein formulation again, this bound can be written as 



P 



1 ^ r 

(iv5Zv'(^^)- / Vix) df^[o,T]ix)) > e] <e-^^^^ (12) 



sup 



By projection at time t, it imphes concentration inequahties for the time-marginals of 
the empirical measures similar to inequalities and even ((21). But above all it gives 
concentration estimates at the level of the paths. In return we impose some stronger 
condition on the required size of the sample (however, by (jHI), one can also get less precise 
estimates valid for any number of particles). 

Assume for instance that one is interested in the behaviour of a point Yt evolving ac- 
cording to (fTUI) . Then, from (fT^ . one can derive error bounds in the approximation 

1 ^ 

by ^^^V^l^*) of the expectation of quantities (p(Y) which depend on the whole path, 

i=l 

such as the distance d{Y,A) = inf {|Ft — vl'it G [0,T],y G A} of the trajectory to a 
given set A in W^, which measures how close Yt has been to A, or the maximal distance 
sup {|Ft — G [0,T] j to a given point x in the phase space W^: for instance, under 
the assumptions of Theorem El for any T > and a G (0, 1/2) there exist some positive 
constants K and A"o such that 



P 



1 ^ 

E[d{Y,A)]--J2diX\A) 



N 
1=1 



> e 



for any Borel set A in M'^, £ > and A^ > A^o^^"^ exp (A^o^^"^^") 

Theorem 121 will be proven in detail in SectionsElandlUalong the following lines. Following 
[T8] we proceed by coupling, by introducing a family of A^ identically distributed processes 
= (l^/)o<f<r solution to the (nonlinear) stochastic differential equations 



dY; = V2dBl-VV{Yi)dt-VW *fit{Y;)dt 



Y^ = 



l<i< N: 



here fit is the solution at time t to ©, but is also the law on Mf^ of any Yt by Ito's formula, 
and, for each i, = (i?j)o<t<T is the Brownian motion driving the evolution of X*. In 
particular the paths Y^ are close to the paths X* and one can prove that there exists some 
constant C (depending only on T) such that 

W^l,[0,T](Ai[0,T],/ip,T]) < W^l,[0,T](/i[0,T], '^[^.T]) 
1 ^ 

hold almost surely, where z/j^-j.] := — ^^5yi; hence controlling the distance between /i[o,T] 

1=1 

and fi^rp] reduces to the same issue with /i[o,T] and z^j^^^]. 



9 



But, by definition, the N processes for 1 < z < are independent and distributed 
according to /^[o,t]- Tlien Tlieoremn ensures good concentration estimates for tlie empirical 



measure i'^t] around the common law yU[o,T]- In the end we obtain the bound 



P [W^l,[0,T](/i[0,T],/ip,T]) > < P [w^l,[0,T](/i[0,r],Z>p,T]) > 



e ~ 
C. 

under some condition on e and A^. 

The proof is actually an adaptation of the argument given in [HI Section 1.6] of estimates 
(j2)) for time-marginals. The current proof turns out to be simpler in the sense that it 
consists in fewer steps; in return each of these steps is somehow more delicate: for instance, 
as we shall see in the following sections, the proof of Theorem |21 requires the computation 
of the metric entropy of some space of Holder-continuous functions, and checking that the 
law of Y fulfills the assumptions of this theorem needs some strong integrability in Holder 
norm on solutions to stochastic differential equations. 

An adaptation of this proof leads to quantitative estimates on the phenomenon of propa- 
gation of chaos. For instance, letting 

N{N -I) 



be the empirical measure on pairs of paths, the asymptotic independence of two paths 
(among A^) can be estimated as in 

Theorem 3. With the same notation and assumptions as in Theorem{^ for allT > and 
a G (0, 1/2) there exist some positive constants K and Nq such that 



P 



W^1,[0,T](/^[0,T] ® /^[0,T], Aj^'t] 



> 6 



for all6> and N > NqE''^ exp(A/'o e"^/") 

Here the constants K and A'o depend on T, V, W, a and a finite square-exponential mo- 
ment of /io, and VTi ^.r] stands for the Wasserstein distance of order 1 on the product 
space C X C. The proof consists in writing the coupling argument for pairs of paths and 

comparing /X[o,t] ® fi[o,T] and z/^'^] := ^ _ V'.^^) ^"7 means of jp^^iYW^)- 

Let us finally note that it would be desirable to relax the assumptions made on the 
potentials V and W, in particular so as to include the interesting case of the cubic potential 
W{z) = {zl'^/S on M, which models the interaction among one-dimensional granular media 
(see 0). It could also be interesting to consider the whole trajectories {Xt)t>Q, and derive 
concentration bounds on functionals such as hitting times for instance. 

Before turning to the proofs we briefly recall the plan of the paper. In the coming 
section we prove Theorem ^ for general C- valued independent variables. The study of the 
particle system is addressed in the following two sections: in Section 3 we reduce our con- 
centration issue on interacting particles to the same issue for independent variables by a 
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coupling argument, whereas in Section 4 we check that we can apply our general concentra- 
tion result to these independent variables; with this in hand we can prove Theorem |21 An 
appendix is devoted to a general metric entropy estimate in a space of Holder-continuous 
functions, which enters the proof of Theorem ^ 



2. A PRELIMINARY RESULT ON INDEPENDENT VARIABLES 

The aim of this section is to prove Theorem ^ for independent and identically dis- 
tributed random variables valued in C. We have seen how this result, applied to the 
artificial processes (F/)o<t<T, enters the study of our interacting particle system. 

The proof goes in three steps: truncation to a ball of C", compact for the topology 
induced by the uniform norm; covering of and then of 'P(S^) by small balls on which 
one develops Sanov's argument; conclusion of the proof by optimizing the introduced pa- 
rameters. Since the argument follows the lines of the proof given in p, Section 2.1] in the 
finite dimensional case, in which /i is a measure on M^, we shall only sketch it, stressing 
only the bounds specific to our new framework. We refer to |6j for further details. 

Step 1. Truncation. Given i? > 0, to be chosen later on, we denote i?^ the ball 
{/ G C"; II /II a < R} of center and radius i? in C". This set B'^ is a compact subset of C 
for the topology induced by the uniform norm || ■ ||oo: indeed it is relatively compact in C 
by Ascoli's theorem, and closed since if / in C is the uniform limit of a sequence {fn)n in 
C", then II /II a < liminf ||/n,||a, and in particular / belongs to Bf^ if so do the /„. 

n 

Letting l^g be the indicator function of B"^, we truncate /x into a probability measure 
fiR on the ball defined as 

_ IsgA' 
'- « ■ 

Note that is positive for R larger than some Rq depending only on a and Ea : = 

J e""^"" dn(x). In this step we reduce the concentration problem for C to the same issue 

for the compact ball B'^, by bounding the quantity F\Wp{fi, fi^) > e~\ in terms of fiR and 

1 ^ 

an associated empirical measure fi^ := — 5^^-^ where the are independent variables 

k=l 

with law fiR. 

Bounding by above the || ■ ||oo norms by || ■ ||q, norms when necessary, we proceed exactly 
as in ini proof of Theorem 1.1] to obtain the bound 

P[W^p,[o,T](/i,/i'^) >e]< p[w^p,[o,T](/ii?,/i^) > ve - 2 Ey^Re-'^p''" 

+ exp(-Ar(^^(l-r7)%^'-E,e('^^-'^)^')) ; (13) 

here p is any real number in [1, 2), rj in (0, 1), e,6 > 0, ai < a and R is constrained to be 
larger than i?2niax(l, 9^) for some constant R2 depending only on Ea, a, Oi and p. 
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In the case when p = 2, we obtain 

+ exp {-N{^{1 - vfe^ - 2El e(«i-«)«' 



))• (14) 



Step 2. Sanov's argument on small balls. In view of (jl3|) for p < 2 or (|14|) for 

p = 2, we now aim at bounding P [/i^ G A\ where 



For that purpose, reasoning as in we let 5 > and cover A with J\f{A,5) balls 
(i?i)i<j<_V(^,5) with radius 5/2 in H^pjo,T] distance. Then one can develop Sanov's argu- 
ment on each of these compact and convex balls, and obtain the bound 

P[/i^G^]<p[/i^G U 5,,] < PKe5i]< exp(-iV mf^i7(H/i«)). 

i=l 1=1 i=l 

(15) 

Then, from the Tp(A) inequality for /i, one establishes an approximate Tp(A) inequality 
for /i^: namely, for any Ai < A there exists Ki such that 



H{u, fin) > Y Wp,[o,T]{iy, fiR? - K, e"" 



i?2 



for any measure u on B^- With this inequality in hand, given 1 < p < 2 and A2 < Ai < A, 
one deduces from (P3|) the existence of some positive constants 61, rji and Ki such that 



P 



Wp,io,T]{m, fil)>r]e-2 ^y^i^e"?^'] < U{A, 5) exp [-n{^ - K^R 



where we have chosen 5 := 5ie and rj := rji. 

In the case when p = 2, we do not choose rj at this stage, and simply obtain 



(16) 



P 



W2,[o,T]im, f^R) > Ve - 2£;y2i?e-t^'] < U{A, 5) exp (^-nI^^ii^^ - K.R^e-'^^')'^ 
where 6 := 6ie. 

Then, since ^ is a subset of P(i3^), Theorem ITUl in the Appendix enables to bound 
J\f{A, 6) with S = 6i6hj 

exp (ir2(i?£"')^3^2(^^"'^'''" In (max(l,ir2 /?£"'))) (17) 
for some constant K2 depending neither on e nor on R. 

Remark 4. The order of magnitude of this covering number in an infinite-dimensional 
setting constitutes a main change by comparison with the finite-dimensional setting of jH] , 
and will influence the final condition on the size N of the sample. 
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Step 3. Conclusion of the argument. We first focus on the case when p G [1,2). 
Collecting estimates (fT^ . (fTH|) and (fTTj) . we obtain, given A2 < A and ai < a, the existence 
of positive constants Ki, K2, and R3 depending on Ea, a, ai,a, \ and A2 such that 

P[W^p,[0,T](/i,/i^)>^] 

< exp (^ir2(i?^~')'3^^(^^")''" In {max{l, K^Re-')) - N(^^e' - i^ii^V"^')^ 

+ exp(^-N{KseeP -K^e^"'-"^'^')^ (18) 

for all e,9 > and R> R3 max(l, 9^), and for some constant K4 = K^^O, oi). 

Then let A3 < A2. One can prove that the first term in the right-hand side in ()18p is 

bounded by exp ^ — ^ j provided 



R' > Amax(l,£2,ln(£-2)) , Ne' > E3^(^^ ^ (19) 

A3 

for some positive constants A,B and C depending also on A3. Moreover, for 9 = , 

A 

also the second term in the right-hand side in ()18|1 is bounded by exp ^ N as soon 

as R^ > i?4 max(l, ln(£:~^)), for some constant R4 depending on A3. 

/ 1 A^e^x" ^ 
Letting R = e[ — — — - In — — ) if e G (0, 1) and R = V Ae otherwise, and a' < a, both 
VC In 3 i? / 

conditions in ()19|) hold true as soon as N > Noe-^ exp(A/'oe-^/"') for some constant Nq 
depending on Ea, a, A, A3, a and a'. Finally, given A' < A3 < A, this condition ensures that 

P [iy,,[o,T](/i,/i^) > 5] < 2 exp ( - y iV^') < exp ( -^Ne'), 
possibly for some larger Nq. This concludes the argument in the case when p G [1, 2). 

In the case when p = 2, given < < 1, A3 < A2 and 02 < ai, the same condition on N 
and e (for some Nq) is sufficient for the bound 

P [W^2,[o,T](/x, A"^) > < exp ( - ^ r^'Ne') + exp ( - |(1 - r^f N e' 
to hold (by (fT^ ). One optimizes this bound by letting 



02 = a — ^ (g [0, a)) and r] 



^ 1/02 + vAs 

Given A' < A3 < A, this ensures the existence of Nq such that 

> .] < 2 exp ( - ^^^^iV.^) < ( - ^(rTTW^" 

for any e > and > NqE''^ exp^NQC^^^" ). This concludes the proof of Theorem^ in 
this second and last case. 
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3. Coupling argument 

Here begins the proof of Theorem El on the behaviour of our large interacting particle 
system. 

We recall that we are given independent variables Xq in M.'^, with common law fiQ, 
and N independent Brownian motions -B* = {Bl)Q<t<T in I^'^? and we consider the solutions 
X* = (X^*)o<t<T to the coupled stochastic differential equations 

1 ^ 

dXl = V2 dBl - VV{Xl) dt-—Yl VW{Xl -X^)dt, l<i< N. 

i=i 

We also let /U[o,t] be the law of the process Y = (Yt)o<t<T defined by 

dYt = V2dBt - VV{Yt) dt - VW * fit{Yt) dt 

and starting at some Yq drawn according to fiQ] here B = {Bt)Q<t<T also is a Brownian 
motion and /i^ is the law of Yt, that is, the time-marginal of /i[o,T] at time t. 
We want to compare this law /i[o,T] and the empirical measure of the paths 

1 ^ 

i=l 

For this purpose we introduce N independent processes = (i^/)o<t<T defined by 

dYi = V2dBi-W{Yi)dt-VW * ^itiYi)dt, l<i<N (20) 
for the same Brownian motions -B*, and such that Yq = Xq initially. We let 

1 ^ 
1=1 

and in this section we reduce the issue to measuring the distance between /i[o,T] and z^^^^]- 

Proposition 5. In the above notation and under the assumptions 

/3/<DV(x), -fI<D'^W{x)<iI, xeR"^ 

on V and W , where (3, 7 and 7' are real numbers, for any T > there exists some constant 
C depending only on j3, 7, 7' and T such that 

W^l,[0,T](/i[0,T],/i[^,T]) < C'W^l,[0,T](/^[0,T],i>[^,r]) 

almost surely. 

Proof. We first follow the lines of the proof of jHl Proposition 5.1], but in the end we want 
an estimate on the trajectories. Since for each i both processes X* and Y^ are driven by 
the same Brownian motion B\ the process X* — Y^ satisfies 

dixi - r/) = -{vvixD - vviY;)) dt - {vw * i^fixi) - vw * dt. 
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In particular, letting u ■ v denote the scalar product of two vectors u and v in M*^, 

~\xi - y;\' = - {VV{Xl) - V\/(F/)) ■ {XI - F/) 

- {WW * fx^iXl) - WW * ■ {XI - Yi). (22) 

We decompose the last term according to 

VW * /if (Xi) - VW * fit{Yi) = {VW * /if - VW* i2t) (Xi) + {VW* fit{Xi) - VW * jJtiYi)) . 

By our assumption on D^W, the map WV{Xl — ■ ) is F-Lipschitz with F := max(|7|, |7'|). 
Consequently, by the Kantorovich- Rubinstein dual formulation © of Wi,r, 



viy*(/if -/i,)(x, 



VWiXl~y) rf(/if -/iO(y) 

Then (j^^ and our convexity assumptions on V and W imply 
1 d 

In particular, by Gronwall's lemma 



<FW^i,.(/if,/Xi 



XI - Y,r < -{P + 7) \Xl - Yl\' + F iyi,.(/if , \X\ - Yl\. 



\X\-Yl\<Y 



since initially Xq = Yq. Consequently, by convexity of the VTi^o,*] distance, 
'■[0,*]' '^[M^ 



N 



< sup \xi-y: 

^ 0<s<t 

1=1 — — 



^ sup F / 

~ xjr(o<s<t Jo 

Jo 



<^+^)(^--)w^A^^^,^^^)du 



(23) 



(24) 



for all < t < T. But 

W^l,r(/if ,/i«) < W^l,[0,M](A[0,n]>/^[0,n]) < W^l,[0,n] (/i[^,„] , ) + W^1,[0,«] ('^[0^,„] , /^[0,«] ) 

by the projection relation ^ and triangular inequality for W^i,[o,u], so 

Wuodf^fo,t],^m) ^ rel^+^l^£exp (Fel^+^l^(t - «)) l^i,[o,.](z>[^,„], /i[o,„]) 
for all < t < T by Gronwall's lemma again. Then, given < u <t, 

W^l,[0,«]('^[0^,n],^[0,«]) < W^l,[0,t]('^p,t],/^[0,t]) 

since i>p ^] and /i[o,n] sue the respective image measures of i>[^j] and /i[o,t] by the 1-Lipschitz 
map defined from C([0,t],R'^) into C([0,m],R'^) as the restriction to [0,u]. Hence 

w^i,[o,t](A[o,t]5 '^[m) - ^ ^i,m(^^,t]' 



15 

by (1211), for some constant C depending only on T, /3, 7 and 7'. This concludes the argument 
by triangular inequality. □ 



Remark 6. If moreover /5 + 7 > F, where again F := max(|7|, |7'|) then we can let C be 
(/3 + 7) (/3 + 7 — F)~^ in Proposition El independently of T. Indeed, if /5 + 7 > 0, then 
leads to 

/■* F 

w^i,[o,i](/i[^,t],'>p,t]) < r sup / e-^'^+^^^'-'^Uu sn^ Wl^r{^lu.^^u)<-T—Wlmif^m^mt\) 

0<s<tJo 0<u<t P + 7 

and by triangular inequality 

W^i,[o,t](/i[o,t]?/^[o,t]) < ^^^2 r ^i,[o.t](^[M'/^[0'*l) 

provided /5 + 7 > F. 

This is reminiscent of the fact that, under some convexity assumptions on V and W, 
such as /5 > 0, /5 + 2 7 > 0, it has been proven in |Sl El UHj that the time-marginal fit of 
the measure /i[o,t] converges, as t goes to infinity, to the stationary solution to the limit 
equation Q. One can also prove in this context that (in expectation) observables of the 
particle system are bounded in time. 

Hence, under this kind of assumptions, one could hope for some uniform in time constants 
in this coupling argument: that was obtained in jHl Proposition 5.1] for the time-marginals, 
and here for the whole processes. However, contrary to [0] where this property was used 
to approach the stationary solution by coupling together estimates of concentration of the 
empirical measure (as goes to infinity) with estimates of convergence to equilibrium (as 
t goes to infinity), in this work we are concerned with finite time intervals only, and shall 
not use this specific property in the sequel. 



4. Integrability in Holder norm 

In the previous section we have reduced the issue of measuring the distance between 
/i[o,T] and fi^rp^ to measuring the distance between /i[o,T] and the empirical measure z/j^^,] 
of N independent random variables drawn according to /i[o,T]- 

We now solve the latter issue by proving that the measure yU[o,r] fulfills the hypotheses 
of Theorem^ with p = 1, namely, that there exist a G (0, 1] and a > such that 

^e^ll^'ll' rf/i[o,T](a;) := Eexp(a||r||2) < +00. 

Here again C stands for C([0, T], M'^), ||/||q, for the Holder norm of a function / on [0,T], 
and Y = (l^f)o<t<T is the solution to the stochastic differential equation 

dYt = V2dBt- VV{Yt) dt - VW * fit{Yt) dt (25) 



starting at Yq drawn according to /io, where jj^t is the law of Yt. 
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Proposition 7. Let fiQ be a probability measure on M"^ admitting a finite square- exponential 
moment and let Yq be drawn according to fiQ. Given T > 0, V and W satisfying hypotheses 
flllll . let Y be the solution to starting at Yq. Then, for any a G (0,1/2), there 

exists a > 0, depending on fiQ only through a finite square- exponential moment, such that 
Eexp(a||F||^) be finite. 

Assuming this result for the moment we can now conclude the proof of Theorem\^ Let 
indeed a be given in (0,1/2), and ao £ («)l/2). Then, by Proposition [3 and Theorem 
HJ applied with a = and a' = a, there exist some constants K and iVo, depending on 
ao,Q;,T and a square-exponential moment of /iq, such that 

for any e > and > NqS''^ exp(iVo £~^/"), where i>g^,p] is defined by ^ and (1^ . 
Then, by Proposition there exist some constants C, depending only on T, and then K 
and iVo, depending on ao,a,T and a finite square-exponential moment of /io? such that 

P [W^l,[0,T](/i[0,T],/if^,T]) >e]<¥ [W^l,[0,T](/i[0,T],^>KT]) > ^ / C] < e"^^^' 

for any e > and > NqE'"^ exp(A^o ^~"'^^°)- This concludes the argument. □ 



Proof of Proposition^]^ It is necessary and sufficient to prove that there exist positive cons- 
tants ai and 02 such that Eexp(ai||F||^) and Eexp(a2[l^]a) be finite, where [■]a stands 
for the Holder seminorm defined in Section 11.21 

Step 1. We start with the expectation in uniform norm. For this we first note that, 
according to Proposition 3.1], there exist some positive constants M and a, depen- 
ding on /io only through a finite square-exponential moment, such that sup E|yt|^ and 

0<t<T 

sup Eexp(a|y(p) be finite and bounded by M. 

0<t<T 

Then we let b be some smooth function on [0,T], to be chosen later on, and we let 
Zt = exp(6(t) |llp). We want to prove that E sup Zt is finite for some positive function 

0<t<T 

b. By Ito's formula, 

Zt = Zo+Mt+ [ \b'{s)\Ys\^+2db{s)+4b{s)^ \Ys\^-2b{s)Ys - {VV{Ys)+VW*fis{Ys))\ Z.^ds 
Jo 

where (M()o<t<T is the martingale defined as 

Mt = 2V2 [ b{s)ZsYs ■ dBs. 
Jo 

But D'^V{x) > (3 1 for all x G W^, so for any S > and ?/ G M'^ we have 

|Vy(0)|2 



-y ■VViy)<i6-/3)\y\' + 



46 
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Furthermore VW is F-Lipschitz and VVF(O) = 0, so 

-2y ■ \/W*i2siy)<2T [ \y\\y- z\dfisiz) <3T\y\^ + T [ \z\^ dfisiz). 

But / Iz]"^ dfis{z) = E,\Ys\'^ < M on [0,T], so collecting all terms together, we obtain 

Zt<Zo + Mt+ [ [C{s) + D{s) Zsds 
Jo 



where = (2c/+FM + J — 
V 2o 



b{s) and D{s) = b'{s)+Ab{s)^+ {2 ((5-/3) + 3F) b{s). 



Given 6 > such that c := 2 {5—(3)+3 F be positive, we let b{s) such that D{s) = 0, namely 

b{s) = e-"' (6(0)-^ + 4c-^(l - e-^^))"^ 

for some 6(0) to be chosen later on. In particular 6 is a nonincreasing continuous positive 
function on [0, +cxd), and, for this function 6, Zt almost surely satisfies the inequality 

Zt<Zo + Mt + C(0) f Z, ds. 

Jo 

In particular 

E sup Zt<EZo + E sup Mt + C{0) / EZsds. (26) 

0<t<T 0<t<T Jo 

But, by Cauchy-Schwarz' and Doob's inequalities, 

(e sup MtY <E sup |Mtp < 2 sup E|Mip. 

V n<i<T ' 0<t<T n<t<T 



0<t<T 

Then, by Ito's formula again 



0<t<T 



E|Mt|2 = 8 / b{sfE[Zl\Ys\'^]ds < 8 6(0) / E 6(s) exp(2 6(s) ds 
Jo Jo ^ 

< 86(0) ! E exp(3 6(0) mHrfs. 
Jo 

Choosing 6(0) < a/3, this ensures that sup E|Mt|^ whence E sup Mf, is finite. 

0<t<T 0<t<T 

Since, for this 6(0), sup EZt also is finite, it follows from (j26|) that so is E sup Zf, 

0<t<T 0<t<T 

which concludes the argument for the expectation in uniform norm with ai = b{T). 
Step 2. We now turn to the expectation in Holder seminorm. Writing the solution as 

Yt = Yo + Bt- [ {VV{Ys) + VW * fis{Ys)) ds 
Jo 



we obtain 



[Y]a<[B]a+ / (VV(n) + Viy*/i,(n))rfs 



and / (p{s) ds is an antiderivative of ip. Hence, by Cauchy-Schwarz' inequality, 
Eexp(a2[F]^) < (Eexp(4a2[5]2))'/'(Eexp Ua^ f iyViY,) +WW * ^i,{Y,)) ds 
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almost surely; here Y and B stand as before for the map t t— > and t ^ Bt respectively. 



1/2 



But, on one hand, E exp (4 a2[-B]Q) is finite for a2 small enough (see Theorem 
1.3.2] for instance, with E = C and N{f) = [f]a)- On the other hand, by assumption 
(fTTjl . W and VW are respectively B- and F-Lipschitz with B := max(|/3|, and T := 
max(|7|, |7'|), so there exists some constant A such that 

\VV{y) + VW*fis{y)\<A+{B + T) \y\ 

for all y G M'^ and s E [0,T]. In particular 



{VV{Ys) + VW * fi,{Ys)) ds 



< sup 

-la 0<s,t<T 



{A + {B + T)\Y^\)du 



< Ti-"(A + (B + r)||F|U) 



almost surely, and 



Eexp 4 02 



1 2 



{VViY,) + VW*fisiY,))ds 

< exp (8a2T2"^"A2) Eexp (s a^T^'^'^iB + T)^\\YfJj 

which by step 1 is finite as soon as 8a2T^"^°(B + F)^ < ai. 

On the whole, Eexp(a2[l^]^) is indeed finite for 02 small enough, depending on /xq only 
through a finite square-exponential moment, which concludes the argument. □ 



Appendix. Metric entropy of a Holder space 

The aim of this appendix is to establish the bound (fT7|) used in the covering argument in 
the proof of Theorem ^ which amounts to studying the metric entropy of a Holder space 
and of some related space of probability measures. 

In the notation introduced in Sections II. II and ll .21 it follows from Ascoli's theorem that 
the closed ball i3g := i3^([0, T], M'^) = {/ G C"; < R} of center and radius R in C 
is a compact metric space for the metric defined by the uniform norm. Here we estimate 
by how many balls of given radius r < R and centered in the compact metric space B"^ 
can be covered. We note that for r > i? the sole ball {/ G ||/||oo < r} covers 

Notation: Given r > 0, the covering number Af {E , r) of a compact metric space {E,d) 
is defined as the infimum of the integers n such that E can be covered by n balls centered 
in E and of radius r in d metric. Then we have the following result which gives some lower 
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and upper bounds on the covering number N'{B'^, r) and in our case makes more precise 
the bounds given for instance in jT^] or pUj : 

Theorem 8. Given some integer number d > 1, some positive numbers T , R, r and a 
with r < R and a < 1, the covering number Af{B'^,r) of B% equipped with the uniform 
norm, satisfies 

M{B%r) < (loVd^Ys'-^'^^^^^'^K 
If moreover, for instance, r < —— -R, then 



V 4 r 

The lower bound ensures that the upper bound, from which depends the condition on 
the size of the sample in Theorems Hand hence El has the good order of growth in R/r. 

Proof. 1. We start by establishing the upper bound. 
1.1. We first consider the case when d = 1. 

T R 

Given J and K some integers larger or equal to 1, we let r = — and rj = and then 

J K 

tj = jeN, 1 <j<J, 

= (fc-l)r/, keN, -K + l <k<K. 

Then we cover the rectangle [0, T] x [— -R, +R\ in x M^,, which contains the graph of all 
functions in i3^([0, T], M), by a lattice with step r in t— axis and rj in ?/— axis. 

ri ri 

Then let / be a given function in i3^([0, T], R). Since the intervals [Vk — —^Uk + -] 

cover the interval [— -R, for every integer j G [1, J] there exists some integer k{j) G 
[-K + l,+K] such that 

Uitj) -yk{j)\ < |- 

In particular 

IVkU+i) - ykU)\ < I + \f{tj+i) - f{tj)\ + ^<V + R -t,r<V + Rr''<2rj 

if we suppose i^T" < J". But since the yk take values which are regularly distant of r/, it 
follows that more precisely 

\yk(j+i) - yku)\ < V- 

From this map k : [1, J] n N [~K + l,K]r\ N, we define the function fk : [0, T] 
[—R, +R\ affine on each interval of the subdivision (0, ti, ■ ■ ■ , tj, T) and such that 

/fc(o) = fkih), 

fk{tj) = Vkij), ^<j<J 
fk{T) = fkitj). 
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In particular we note that this function is Lipschitz with 



\fk{t) - fk{s)\ _ _ \yk{j+i) - yk{j)\ . V 



sup i i = sup 

0<t,s<T \t — S\ l<k<K 



\yk{j+i) 


-yk{j)\ 







but that it does not necessarily belong to B^([0, T],R). 

The number of such functions is bounded by the number of J-uples {yk{j))i<j<j such 
that \yk{j+i) — yk{j)\ < for 1 < j < J — 1, that is the number of J-uples {k{j))i<j<j such 
that \k{j + 1) — k{j)\ < 1 for 1 < j < J — 1. Such J-uples are obtained by choosing A;(l) 
among 2K values, then k{2) among 3 values for —K -\-2 < k{l) < +K — 1 or 2 values for 
k{l) — —K+ 1 and +K, and so on. Hence there exist at most 2X3'^'"^ such functions fk- 

R 

If we now let K be the smallest integer larger or equal to 4 — and J such that i^T" < J°, 

r 

then 

r 

11/ /fclloo — 2' 

Indeed, given t in [0, T], there exists some integer number j in [1, J] such that t belongs to 



<R\t- + \m) - y.-wi + ^ I*.- - ^1 < R{iy +1 + 11 <2v< 



\f{t) - fk{t)\ < \f{t) - f{tj)\ + \f{tj) - Mtj)\ + \Mtj) - Mt)\ 

r 
2 

T 

Hence we can cover i3^([0, T], M) by less than 2i^r3'^^^ balls of radius - of the metric 

space C([0, T], M) equipped with the uniform norm, and if we let J and K be the smallest 

I R 1 R 

integers larger or equal to 5« T (— ) " and 4 — respectively, then KT"' < J"^ holds true and 

r r 

2K3'-' < 10-35^^(?)". 
r 

1.2. From this we now deduce the upper bound in the general case d > 1. 

Let F be a given function in 13%{[0,T],R'^) with components e i3^([0,T],M) for 

1 <i < d. Let now J and K be the smallest integers larger or equal to 5" T (v^ — ) « and 

r 

R 

AVd— respectively. With each i, we associate an integer ki in [1, 2 ir3'^~-'^] such that 
r 

1 1 -^i fki 1 1 00 — 



2Vd 

r 

where the fk are the functions in C([0, T],M) defined in the first step (relatively to —j= 

vd 

instead of r). 

Then the function F^^^...^^^ with components fk^ for 1 <i < d belongs to C([0, T], M"') and 



r 



satisfies \\F—Fk^^...^kJ\oo < -• Moreover there are at most (2i^3 ^) such functions 



2 
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Consequently we can cover <Bg([0, i?], M'^) by less than (2 X 3'^"^)'^ balls of radius - of the 

metric space C([0, T], R*^) equipped with the uniform norm, whence by less than (2 K 3'^^^)'^ 
balls of radius r of the metric space i3^([0, T], M'^) equipped with the uniform norm. 

This concludes the proof of the upper bound of the covering number M{B%{[Q, T] , W^) , r) . 

2. We now turn to the lower bound. 
2.1. We first consider the case d—1. 

We can give different types of lower bounds by considering special functions of the type 
fk defined in the first step. Here, for instance, we give the detail for one of them. 

T 

Given some non-zero integer J, we let r = — and rj — r"" R, and then 

tj = U-l)r, jeN, 1 <i<J, 

Prom a map k : [1, J] nN — > [0,1] flN, we define as above the function fk : [0, T] [yo, yi] 
affine on every interval of the subdivision {0,ti, - ■ • , tj, T) and such that 

fk{0) = fkiti) 

fk{tj) = yk(j), 1<3<J 

fk(T) = h(tj). 

_ 1 _ 1 
Given some integer i such that — r — < £ < r " )We define the function 

fke ■ [0, T] [ye, y^+i] such that 

fkiit) = Mt)+ir]. 

Then /m belongs to B%{[0,T],R) and \\fke - fk'e'Woo > ^ if fn fk'e- 

If for instance r < inf(i?, 2~^T"i?) and J + 1 is the smallest integer larger or equal to 

2"«r(— then - fk'e'Woo > 2r if fke ^ fk'e- 

Thus we have found L2'^ elements in ,B^([0, T],M) mutually distant of at least 2r in 

_ 1 _ 1 

uniform norm, where L is the number of integers £ between — r " H — and r " Hence 

^ 2 2 

M{BU[0,T],R),r) > L2'. 

But 

^>2((r-'^-^)-l) + l = 2r-«-2> ((^)^-^r-2>^-^-2. 

r°= R 

If moreover, for instance, r < —— R, then L > — and 

' - 4r« + 4 ' - 2r 

A/-(i3S([0,T],M),r) > 1^22-^^(f)«. 
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2.2. From this we now deduce the lower bound in the general case d > 1. 

The functions Fk^^^,... ^k^i^ with components fk^^^ for j = 1, ■ ■ ■ , (i where fk^i^ have 

been defined in the step 1, belong to i3g([0, T], M'^) and are mutually distant of at least 

This concludes the argument for the lower bound of the number A/'(i3^( [0, T] , W^) , r) . □ 

We now turn to the covering number of the corresponding space of probability measures: 
given a Pohsh metric space {E,d), p > I and 5 > 0, we denote J\fp(V{E),6) the covering 
number of V{E) for the Wp distance. 

Then we have the following general result which is proven in [HI (see also jTHj, [Tl]): 

Theorem 9. Let {E, d) he a Polish metric space with finite diameter D, and let p and 5 
be some real numbers with p > 1 and < 6 < D. Then the covering number J\fp(V{E), S) 
ofV{E) satisfies 

Afp{r{E),6)<{8e^Y^^^-^\ 

Note that ii 5 > D we simply have Afp{V{E),5) = 1 since the Wasserstein distance 
between any two probability measures on E is at most D. 

Since B"^ equipped with the metric defined by the uniform norm is a Polish metric space 
with finite diameter 2R, we deduce the following result: 

Theorem 10. Let d > 1, p, T , R, 6 and a be some positive numbers with p > 1, 6 < 2R 
and a < 1. Let also B'^ = {/ G C"; ||/||a < R} be equipped with the uniform norm. Then 
the space Vi^B"^) of probability measures on B'^ can be covered by AfpiV {B"^) , 6) balls of 
radius 6 in Wasserstein distance Wp, with 

AfpiViBf^),5) < {16 eR5-'Y ^ ' 

For 6 > 2R, we have 

Up{V{Bl),5) = l. 
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